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Abstract 



We revisit the one-dimensional stochastic model of Lubensky and Nelson 
[Biophys. J 77, 1824 (1999)] for the electrically driven translocation of 
polynucleotides through a-hemolysin pores. We show that the model cor- 
rectly describes two further important properties of the experimentally ob- 
served translocation time distributions, namely their spread (width) and 
their exponential decay. The resulting overall agreement between theoreti- 
cal and experimental translocation time distributions is thus very good. 

Key words: Nanopores; translocation; Stochastic modeling; Brownian 
motion; a-Hemolysin 
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Introduction 

The translocation of biopolymers such as DNA, RNA, or polypeptides through 
protein pores plays a key role in various cellular processes yj). Apart from 
these biological systems, also artificial, so-called solid-state nanopores have 
recently attracted a lot of attention due to their promising potential as a 
new generation of fast and cheap DNA sequencing devices and other medi- 
cal diagnostics applications (0)- To achieve such goals, many experimental 
problems still have to be solved, and also the theoretical understanding and 
control of those fundamental transport processes needs substantial further 
development. 

Here, we reconsider one of the earliest and best established theoreti- 
cal models in this context, originally introduced in 1999 by Lubensky and 
Nelson (0), and further studied and developed in numerous subsequent 
works, see e.g. Motivated by the seminal experiments on polynu- 



cleotide translocation through an a-hemolysin pore by Kasianowicz et al. 
(fl2l). Lubensky and Nelson proposed a theoretical description in terms of 
a one-dimensional stochastic model dynamics in a tilted periodic potential 
(0) . While many features of the experimentally observed translocation time 
statistics could indeed be explained remarkably well by their simple model, 
the theoretical spread of the translocation times underestimated the exper- 
imental one by about two orders of magnitude (0). This discrepancy was 
pointed out once again in the review paper but to the best of our knowl- 
edge has remained a tacitly ignored problem of such a model ever since. To 
resolve this problem is a first main issue of our present work. 

Since the quantitative details and sometimes even the qualitative findings 
notably depend on the considered pores and polymers, we follow Lubensky 
and Nelson in specifically focusing on the experimentally best studied case of 
the a-hemolysin protein pore and polynucleotides of single stranded DNA or 
RNA. In particular, for this system the translocation time distributions are 
quantitatively quite well documented, not only with respect to their above 
mentioned spread but also with respect to their decay for large times 
16l ). The second main point of our paper is that the model of Lubensky and 
Nelson also correctly reproduces the experimentally observed exponential 
decay. 

The overall result is a very good comparison of the complete theoret- 
ically predicted translocation time distributions with experimentally ob- 
served data sets. 
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Experimental System 

The basic experimental set up is illustrated in Fig. 1. Charged, single- 
stranded polynucleotides (DNA or RNA) in aqueous solution are exposed 
via electrodes to an externally applied voltage. Two fluid compartments 
are separated by a phospholipid membrane and are connected by a sin- 
gle a-hemolysin protein pore. Since the phospholipid membrane is non- 
conducting, practically the entire voltage drop occurs within the pore and its 
immediate neighborhood. Whenever a polynucleotide diffusively approaches 
the pore from the "upper" side in Fig. [H the electrical forces direct it into 
the pore and drive it to the other side of the membrane. Every such translo- 
cation process is experimentally observable as a reduction of the electrical 
current through the pore. Even though the polynucleotides are (practically) 
identical, the durations of the current blockades exhibit quite significant 
statistical variations. The main theoretical task is to qualitatively explain 
and quantitatively model the experimentally observed translocation time 
distributions. For further details, see, e.g., (jlMld). 



Model 



According to Lubensky and Nelson (0), the polymer translocation process 
is modeled by means of a single dynamical state variable x(t) (slow/relevant 
collective coordinate), defined as the contour length of that part of the 
polymer chain which already has passed through the pore until time t. In 
particular, hydrodynamic (dissipative) and steric (entropic) effects of the 
chain segments outside the pore and its immediate neighborhood are con- 
sidered as negligible. The most immediate justification of this approxima- 
tion is that otherwise a disagreement with the experimentally observed lin- 
ear de pen dence of the mean translocation time upon the polymer length 
113, UM) seems practically unavoidable This general 

fact is nicely illustrated e.g. in Ref. (j22l ) by means of a model very similar 
in spirit to the one by Lubensky and Nelson, but in addition taking into 
account the polymer degrees of freedom far from the pore region within an 
approximative, accompanying equilibrium description originally due to (|23l). 

The state variable x(t) is subjected to several kinds of forces, most no- 
tably due to the externally applied voltage and the electrostatic, mechanical, 



1 While this experimental finding is beyond any doubt in the case of polynucleotide 



translocation through a-hemolysin pores 
results in the case of solid-state nanopores 
the Lubensky-Nelson model may be inappropriate in such a case. 



Ld any doubt m thi 

Jul M H E, E3, 

are contradictory 



), the corresponding 
l21f). For this reason 
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and chemical interaction of the polymer with the pore walls, but also due to 
entropic forces within the pore and its immediate neighborhood, generated 
by the numerous microscopic degrees of freedom of the ambient solvent, the 
pore, and the polymer itself. All those forces can be considered to arise as 
minus the derivative of a free-energy type potential of mean force The 
remaining effects of the fast molecular degrees of freedom are approximately 
modeled as friction (dissipation) and noise (thermal fluctuations), while in- 
ertia effects are usually negligible on those small lengths and velocity scales. 
Altogether, we thus arrive at an over damped Langevin dynamics of the well 



established form (|24j, |25| ) 



rfx(t) = -$'(x(t)) + ^J%kTt{t) , (1) 

where £(t) is a delta-correlated Gaussian white noise, r\ is the friction coef- 
ficient, and kT the thermal energy. 

In the simplest case of a homopolymer, the force —$>'(x) remains invari- 
ant when the entire polymer is translocated by the length a of one monomer, 
i.e. $>'(x + a) = $'(x) for all x. As a consequence, 3>(x) must be a tilted pe- 
riodic potential, consisting of a strictly a-periodic part U{x) and a constant 
"tilting" force F, 

$(x) = U{x) - Fx . (2) 

Advancing the polymer by one monomer length a changes its (free) en- 
ergy by $(x + a) — $(x) = —aF according to Eq. [2j Following Lubensky and 
Nelson (0), the same change of state is obtained by moving one monomer 
from one to the other end of the polymer chain. The energy required for 
such a move is qV, where q is the charge of a monomer and V the externally 
applied voltage (with sign convention as indicated in Fig. [1]). We thus can 
conclude that p) 

aF = -qV. (3) 
We remark that the the nominal charge per nucleotide is equal to 

q e = -1.602... • 1CT 19 C (electron charge). (4) 

However, it is by now well established (0-Eil) that due to various electroki- 
netic effects of the ambient ionic solution and the pore (screening, electroos- 
mosis, electrophoresis, polarization and field confinement mechanisms), the 
relevant effective charge q in Eq. [3] is reduced by roughly a factor of 10 
compared to the nominal (bare) charge from Eq. HJ i.e. 



q w 0.1 q e 



(5) 
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Due to the above mentioned divers effects which contribute to the charge 
renormalization, the exact value of q depends, among others, on temperature 
and ion concentrations, but also on the specific monomer (nucleotide) of 
which the polynucleotide is composed. 

Under the assumption that V, a, and q are (approximately) known, 
the force F in Eq. [2] is thus fixed through Eq. [3l Much more difficult to 
theoretically estimate from first principles are the friction coefficient rj in Eq. 
[H and the periodic potential U(x) in Eq. [2j They may thus be considered 
as a model parameter and a model function, respectively, which remain to 
be determined by experimental means. 



Velocity and Diffusion 

As a first quantity of interest we consider the average translocation velocity 
v of the polymer through the pore. Focusing on not too short polymers, 
"boundary-effects" while the polymer enters and exits the pore are negligible 
and v follows as the time- and ensemble-averaged velocity x(t) from the 
model in Eq. [I] with an infinitely extended periodic potential U{x) in Eq. [21 
The analytical solution of this problem goes back to Stratonovich |32) and 
has subsequently been rederived many times, see e.g. chapter 11 in (0). 



Adopting the notation from (|33l . |34| ) , this solution takes the form 



kT 1 - e~ aF / kT 



where we have introduced 



= e 4>(x)/fcT f X e -Hv)/vr 

J x—a a 



(7) 



A further quantity of interest is the random spread of the transloca- 
tion velocity (and thus of the translocation time) about its mean value v, 
quantified by the diffusion coefficient 

D=1 , m {[x( t )-x(0)- rf p> 

t— >oo 2t 

where (•) indicates an average over the noise £(i) in Eq. Q] and over the 
initial positions x(0). 

Similarly as for the velocity v, the analytical result for the diffusion 
coefficient in a tilted periodic potential, Eqs. [Q [21 has been independently 
obtained several times. To the best of our knowledge, the first closed, exact 



Polymer translocation through nanopores 



expression for D is buried in the paper (|35l ). For the second time, the same 



problem was solved again by Lubensky and Nelson, see Appendix B in 



Further rediscoveries are due to (|36l ) and ([33j, |34| ) . While all those results 
are of course equivalent, the actual formulae for D are quite different and, 
with the exception of (3J, 34), also quite involved. For this reason, the one 



from (|33l . |34j) is most common, reading 



D _kT £f P(x)J(x) 



where we have introduced 

J(x) = e-*W kT r +a< ^-e^' kT . 



The main quantity of interest later on will be the dimensionless ratio 
av/D (cf. Section "Spread of Translocation Times" and Ref. (0)), given 
according to Eqs. [6] and [9] by 



D K%P(z)J(x) 

A first main result of our paper consists in the observation that the 
leading order behavior of Eq. [11] for small values of aF/kT takes the form 

- - — , (12) 
D kT ' K ' 

independently of any further details of the periodic potential U(x). In the 
simplest case, this result follows by expanding the left bracket in the nu- 
merator of Eq. [TT]to first (=leading) order in aF/kT and evaluating all the 
remaining integrals for aF/kT = (leading=zeroth order in aF/kT). In 
doing so, the integrals in Eqs. \7\ and [10] become x-independent and, as a 
consequence, the denominator in Eq. [TT] becomes equal to the right bracket 
in the numerator. An analogous (but more tedious) expansion resulting in 
PT2l is also contained in (0). Both expansions, however, become question- 
able in the weak noise limit (small thermal energy kT), since the expansion 
coefficients in general will inherit exponentially large values from the inte- 
grands exp{±[U(x) — U(y)]/kT} contributing via Eqs. El E [8]to the multiple 
integrals in 1111 

Our first remark is that Eq. [12] in fact still remains valid for asymptoti- 
cally weak noise (kT — > 0) and not too large F-values, so that the dynamics 
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in Eqs. HJ [2] is governed by rare, thermally activated transitions between 
metastable states: In this case, the dynamics can be approximately de- 
scribed by a one-dimensional random walk between discrete sites at distance 
a with certain forward and backward hopping rates r + and r_, respectively. 
As a consequence (0), one obtains v = a(r+ — r_) and D = a 2 (r + + r_)/2. 



Furthermore, detailed balance symmetry (|25l ) implies for the forward and 
backward rates the relation r + /r_ = exp{aF/kT}. We thus obtain the 
asymptotically exact result 

av 1 - e - aF / kT 

- = 2 1 - e _ aF/kT = 2 tanh(aF/2fcT) , (13) 

independently of any further details of U{x). In particular, for small aF/kT 
one readily recovers Eq. [12j 

Our second remark is that both for asymptotically large F and for 
asymptotically large kT, the effects of the periodic potential U(x) in Eq. 
[2] become negligible (jsl, with the consequence that the exact expression 
in Eq. [11] approaches once again the asymptotics of Eq. [12j The same 
conclusion is of course recovered if the variations of the periodic potential 
U (x) itself become negligibly small. 

Our last remark is that the diffusion coefficient D as a function of the 
tilt F develops an arbitrarily pronounced maximum for sufficiently small 



kT, see (|33l . l34l). Nevertheless, closer inspection along the lines of (|33l . l34l) 
shows that the ratio D /av remains a strictly decreasing function of F within 
the neighborhood of the maximum of D. 



Translocation time distribution and exponential de- 
cay 

A "successful translocation event" starts when the polymer enters the pore 
from one side and ends when it exits at the other side. In contrast, if 
the polymer exits at the same side as it entered, we are dealing with an 
"unsuccessful translocation attempt". Following Lubensky and Nelson, we 
ignore unsuccessful attempts and henceforth only consider the successful 
events. Regarding the experimental identification of such events see e.g. 
(fl6l ) . The statistical distribution of their duration is the quantity of foremost 
interest from now on. 

A main achievement of Lubensky and Nelson's work (Q) is an analytical 
approximation for the distribution (probability density) ip(t) of transloca- 
tion times. In fact, the approximation becomes asymptotically exact for 
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sufficiently large numbers N of monomers, i.e. 

N = L/a » 1 , (14) 

where a and L denote the lengths of one monomer and of the entire polymer, 
respectively. 

Remarkably enough, but also quite plausible at second glance, the only 
parameters entering the translocation time distribution ip{t) are the polymer 
length L and the velocity v and diffusion coefficient D of the corresponding, 
infinitely extended dynamics. Furthermore, it is convenient to employ the 
rescaled, dimensionless time 

vt , . 

r = — (15) 

and the dimensionless auxiliary parameter 



ana- 



Referring to Appendix A of (0) for the detailed calculations, the final 
lytical expression provided by eq. (A6) in (0) and can be rewritten in the 
form 



n 2 1 

C 



t3/2 ft* cxpf 2 ^ I {T ~ n)2 } 

n=l,3,5,... exp | K -t- KT J 

where the normalization constan c is given bjH 



c=^=[l-.-V"]. (18) 

In other words, the translocation time distribution ip{t) actually does not 
depend separately on all three parameters L, v, and D, but only on the two 
specific combinations v/L and k = 4D/vL. 

The behavior of Eq. [171 for large t is not obvious at all. In particular, for 
large r the summands on the right hand side are negative up to quite large 
n-values, while those for even larger n are positive. In view of the property 
ip(t) > and the expected asymptotics ip(t) — > for t — > oo, there must 
be a very fragile cancellation of positive and negative summands. On the 
other hand, we observe that, according to eq. (A2) in (Q), an asymptotically 
exponential decay of the right hand side in Eq. [IT] may be conjectured with 
a decay rate 

A = (7t/2) 2 k + 1/k . (19) 



2 There is a typo on the right hand side of eq. (A6) in (0): the first factor 2 should be 
replaced by 1/2 
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In view of this surmise, we thus may rewrite Eq. [17] in the following equiv- 
alent form 

i;(t) = c 9(kt) e~ XT , (20) 

where c' is another normalization constant, and where the auxiliary function 
g(x) is defined as follows 

ztt 2 /4 ni _ 1 

g(x) = 0.45731 -^ £ ^ . (21) 

n=l,3,5,... 

As can be seen from Fig. [2] the function g{x) converges towards a finite 
limit for x — > oo and the specific numerical factor 0.45731 in Eq. [21] has 
been chosen so that this limit is (practically) unity. 

This brings us to the next main result of our paper: According to Eq. 
[2D] and Fig. [2] the distribution of translocation times ip(t) predicted by the 
model of Lubensky and Nelson exhibits an exponential decay for large times, 
igreement with the experimental findings from ( 13l- lH). 



m ag 

We finally remark that equation [20] together with Fig. [2] and Eqs. [T5l 
1~6| PT9l provide a quite detailed qualitative picture of the translocation time 
distribution tp(t), and how it depends on the parameters v/L and k: Initially, 
ip{t) remains close to zero, then increases quite steeply up to a maximum 
at t max = T max L/v, where T max solves ng'(KT max ) = Xg(KT max ), and finally 
approaches an exponential decay. 

Numerically, for any given set of parameters v/L and k the quantitative 
evaluation of ijj(t) according to Eqs. [T5II18I is straightforward. In particu- 
lar, for k — > (vanishing thermal fluctuations) one recovers the expected 
deterministic limit ip(t) — > S(t — L/v). Typical examples will be presented 
in Section "Comparison with Experiments" below. 



Spread of translocation times 

Concerning a quantitative comparison between the theoretical prediction 
from Eq. [T7] and experimental data, we first observe that there are two 
fit parameters: One (namely L/v in Eq. [T5|) amounts to a quite trivial 
rescaling of time, which can be readily fixed e.g. by fitting the theoretical 
peak of ip(t) to the experimentally observed one. The remaining, second 
fit parameter is k from Eq. [T6] which can be determined by the following 
very convenient procedure originally due to Lubensky and Nelson 0): In 
a first step, the peak position t max of the experimentally observed ip(t) is 
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determined, formally defined via if)'(t max ) = 0. Then the two times ti and 
tR to the left (L) and to the right (R) of t max are determined according to 

Tp(t L ,R) = e~ 1/2 i>(t max ) ~ 0.606 • i>(t max ) . (22) 

In other words, tR — ti quantifies the width of the experimentally observed 
peak of tp(t). Observing that the ratio (tR — ti)/t max is independent of the 
chosen units of time, we can readily recast the approximative relation from 
Fig. 3 of Lubensky and Nelson's work (0) into the form (tR — ti)/t max ~ 
y/2n. In fact, this approximation becomes exact for asymptotically small 
(*R — tLj/tmax arid remains quite accurate as long as (tR — t£)jt max < 1. In 
other words, we obtain the quite accurate estimate 

k*U t -^) 2 provided < 1 . (23) 

2 \ tmax J t max 

On the other hand, combining Eqs. I3"| \12 \ [T4" | and 1161 yields the relation 

kT kN kN , 

— provided — — > 1 . (24) 



(-q)V 



From the experimental data in Fig. 2 of Kasianowicz et al. (1121 ) one 
readily reads off (tR — tL)/t max ~ 1 (0), implying with Eq. [23l th at k ~ 0.5. 
Taking into account that N = 210 in the experiments from (|12l ). the right 
hand side of Eq. [241 amounts to kN/A ~ 26. Hence the condition nN/4 > 1 is 
fulfilled and we can apply Eq. [24]to conclude that kT/qV ~ 26. On the other 
hand, using T ~ 293 K (room temperature), V = 120 mV (experimental 
voltage from (fl2l V). and the nominal charge q = q e ~ —1.6 • 10~ 19 C per 
nucleotide from (J3j), we obtain the result kT/(—q)V ~ 0.21. In view of 
the discprepancy with the relation kT/(—q)V ~ 26 following from Eq. [24"1 
Lubensky and Nelson concluded that their model, Eq. [TJ which implied Eq. 
24"! was inconsistent with the experimental facts. 

In the following, we argue that this conclusion is not tenable. Rather, 
the prediction from Eq. [23] of the model, Eq. [H agrees quite well with the 
experimental findings. To this end, we first evaluate the right hand side 
of Eq. [Ml by means of the results from several more recent, and therefore 
possibly more accurate experiments than in the original work of Kasianowicz 
et al. (|12l): From the two data sets of Meller at al. displayed in Fig. 2 of 
their work (fl3l ) (see also Fig. 3 in (14) and Fig. 6 in (jl5l )) one can infer 



with Eq. [23]that k — 0.09 and k ~ 0.11, respectively (see also next Section). 
With N = 100 (H3) it follows that kN/4 ~ 2.3 and kN/4 ~ 2.8, respectively. 
Likewise, one can infer from the data in Fig. 2 of Bates et al. (j37h that 
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K — 0.17 and hence with N = 60 that nN/4 ~ 2.6. Finally the data from 
Fig. 5 of Butler et al. ^) imply that n ~ 0.19 and with N = 50 that 
KiV/4 ~ 2.4. 

We remark that in all those experiments homopolymers have been used 
and that we do not know of any further data of this kind (histograms of 
translocation durations for homopolymers) in the literature. For the sake 
of completeness, we may also include here the data for heteropolymers with 
N = 92 from Fig. 2c in Maglia et al. ((H), yielding k = 0.13 and hence 
kN/4 ~ 3.0. 

Turning to the quantity kT/(— q)V appearing on the left hand side of 
Eq. [24"t we observe that the voltage V = 120 mV, and the temperature 
T ~ 293 K was (approximately) the same in all the experiments from ([lj 



19 . 1371 . l38l ). Using the nominal charge q = q e per nucleotide from 



one thus recovers the same result kT/(—q)V ~ 0.21 as before, and hence 
Eq. [24] still seems to be violated. However, according to ()26l 43ll) a much 
more realistic estimate follows by employing the appropriately renormalized 
effective charge from Eq. [5l namely kT/(—q)V ~ 2.1. As a consequence, 
the theoretically predicted relation in Eq. [2^1 is satisfied quite well by all 
the more recent pertinent experiments 

Taking the model of Lubensky and Nelson and thus Eq. [2H for granted, 
the above findings for kN/4 may now in turn be used to estimate the renor- 
malized charge q more accurately than in Eq. Since temperature, voltage, 
buffer etc. were almost the same in all cases, the resulting differences in q 
must be mainly due to the different nucleotides (see also below Eq. Qp. 

With respect to the earlier experiment by Kasianowicz et al. (|12l). a 
further reduction of the effective charge by another factor of 10 would be 
a possible, though not very satisfying, explanation (see also next Section). 
A more likely reason seems to be connected with the considerably larger 
kN/4- value compared to the more recent experiments. In other words, the 
experimentally observed spread of the translocation times is unusually large. 
Indeed, one generally expects that the experimentally observed spreads of 
the translocation times still somewhat overestimate the purely diffusive ef- 
fects accounted for in the theory. One possible reason of why the observed 
spread was particularly large in (fl2l ) may be that the data analysis and 
pre-processing according to their detailed current blockade signatures was 
not yet as sophisticated as e.g. in (13-2). Another possible factor is the 
smaller measurement bandwidth of about 24 KHz for the experiments 0), 
compared to 100 KHz for Ezl) and 50 KHz for (0). As will be ar- 

gued in the next section, the most plausible explanation is a relatively large 
spread of the polynucleotide lengths of the samples used by Kasianowicz et 
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al. id)- 

Comparison with experiments 

The purpose of this Section is a comparison between the complete transloca- 
tion time distributions for several of the experiments already considered in 
the previous Section and the corresponding theoretical distributions. Quite 
surprisingly, such a detailed quantitative comparison does not seem to exist 
in the previous literature known to us. 

As a first example, Fig. 3 presents the experimental data reported by 
Butler et al. 

0) 

for single-stranded RNA rCso polynucleotides (i.e. iV = 50 
in Eq. [T4"|) . The theoretical translocation time distribution has been ob- 
tained as described in the previous Section: First, one readily reads off from 
the experimental data points that (£r — ti)jt max ~ 0.61, yielding with Eq. 
[23] the estimate k ~ 0.19. Then, the time-scale parameter L/v in Eq. [15] is 
adapted so as to optimally fit the peak position of the experimental data, 
resulting in the estimate L/v ~ 0.2 ms. According to Fig. [3] the theoretical 
ip(t) obtained in this way agrees very well with the experimental findings 
with the exception of the times t smaller than about 0.1ms. In fact, the 
corresponding experimental data were denoted in Ref. (fl6l ) as "ambiguous 
signals" , possibly caused by "retraction of the threaded configuration back 
into the vestibule configuration, very rapid translocation, or translocation of 
short polynucleotide fragments. Due to the ambiguity in the interpretation 
of these short Deep states, we only designated Deep states with durations 
longer than the minimum as translocations." This ambiguity in the inter- 
pretation of the experimental current blockades ("Deep states") seems to us 
a sufficiently convincing explanation of the deviations from the theoretical 
curve at small times t. An apparently rather similar situation for small t 



has in fact been discussed already in (J3|, [l_2|, |42j) . In view of this ambiguity 
for £ < 0.1 ms we fitted in Fig. [3] the scaling factor c of the theoretical curve 
[m as well as possible to the experimental data for t > 0.1ms rather than 
normalizing it according to 1181 



As a second example, Fig. [4]shows the data of Bates et al. (1371 ) for single- 
stranded DNA cIAqq polynucleotides (i.e. N = 60 in Eq. Q3J). Theoretically, 
we proceeded as before with (tji — tL)/t max ~ 0.58, k ~ 0.17, and L/v ~ 
0.42 ms. Again, the overall agreement between theory and experiment is 
very nice with the exception of large times 



3 In fact, the authors of Ref. (|37r ) mention that the tail of the experimentally observed 
distribution extends to even much longer times without providing the actual data. 
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As pointed out by an anonymous referee, the latter disagreement can be 
naturally explained by the well-known directionality of the polynucleotide's 
sugar-phosphate backbone, resulting in two different translocation time dis- 
tributions depending on whether the DNA enters the pore with its 3' or 5' 



end first (J3j, LL2|, [16|, I3SH45I) . Denoting the probabilities of 3' and 5' entries 
by p and 1—p and the concomitant two distributions by tpi(t) and ^(t), 
the total (experimentally observed) distributions is given by 

m = pMt) + - p)Mt) ■ (25) 

Within the model of Lubensky and Nelson, ipi(t) and ip2(t) are both of the 
form [TT1 For both of them, the parameter k must be the same according 
to Eqs. El [121 US under the plausible assumption that the effective charge 
q is (approximately) the same for both DNA orientations (see also below 
Eq. EJ). On the other hand, there may in general be two different time 
scales L/v\ and L/v2, and likewise for r in Eq. El Along these lines, 
the best fit to the experimental data of Bates et al. (1371 ) was obtained for 
K — 0.18, L/v\ ~ 0.42 ms, Ljvi — 0.82 ms, and p ~ 0.81. According to Fig. 
El the resulting agreement between experiment and theory is indeed very 
good. We remark that while quantitative experimental estimates for the 
two "extra parameters" p and v\ jv-i of the extended theory [25] do not seem 
available, the "event diagrams" for dA$Q in Fig. 3 of the paper (fl6l ) and 
for gL4ioo in Fig. 2 of the paper (fioh qualitatively compare very favorably 
with our above findings p 0.81 and v\jv>i ~ 2 for dA§Q. We finally remark 



that also in the works ([13|, |40j) two "groups" of cL4ioo translocation events 
were identified, "group 1" containing about 80% of the events (at 20° C) 
(flit ) , and that the two "groups" can be attributed to the two different DNA 



orientations (|40j, |42|) . 

As expected, the already very good agreement in Fig. E] with the rC^o 
data by Butler et al. (fl6l ) could not be notably improved any more by means 
of the extended theory from Eq. [25j This is consistent with the "event 
diagram" for rC^o in Fig. 3 of the paper (fl6l ). evidencing that while the 
RNA may indeed again exhibit two different orientations, the translocation 
time distributions happen to be very similar for both of them. 

We finally return to the experiment of Kasianowicz et al. (fl2l). using 
poly[Uj samples with a nominal length of N = 210 nucleotides. In a later 
work (139l ). the same group used a poly[U] sample whose length distribu- 
tion was specified as N = 150 ± 50 nucleotides. According to a personal 
communication by one of the authors (D. Branton, Harvard University), a 
comparable polydispersity of N = 210 ± 70 may thus be considered as quite 
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plausible also in the earlier work (|12l ). Theoretically, we took into account 



this fact by including on the right hand side of Eq. [25] an integral over a 
Gaussian length distribution. Formally, this is achieved by replacing L in 
the definitions [TBI and [TBI by zL, where z > is Gaussian distributed_| with 
average 1 and standard deviation 1/3, and likewise for the two time scales 
L/v\, L/v2 entering into Eq. [25] Fig. [6J shows our best fit to the experimen- 



tal data of Kasianowicz et al. (1121 ). obtained for k ~ 0.1, L/v\ ~ 0.31ms, 
L/v2 — 1.3 ms, and p ~ 0.58. The agreement is obviously very good, except 
for the leftmost data point in Fig. [6j In fact, there are additional experi- 
mental data points at even smaller t-values with ^-values beyond the range 
displayed in Fig. [6j Similarly as in Fig. 02 they are commonly considered 
to be due to polymers that partially entered the channel but then retracted 
rather than actually traversed the channel 142 ). Such events are 



not covered by our present theory, explaining the disagreement in Fig. [6J 
at small t. Accordingly, the experimental data in Fig. [6j have not been 
normalized to unity but rather so that the agreement with the theory was 
optimal. 

We remark that such "unsuccessful translocation attempts" could be 
sorted out in the experimental data in Figs. H] and [5] thanks to their very 
specific current blockade signatures (j37l ) . Hence the theory explains the data 
even at small t. A similar identification of such events in the experiments 



by Kasianowicz et al. (Il2l ) and by Butler et al. (Il6l ) was apparently not 
possible. 

Returning to Fig. [6j the obtained fit k = 0.1 implies that k.N/4 = 
5.25 and thus with Eqs. [25] and H] that q ~ 0.04 q e . Such an effective 
charge is smaller than the estimate from equation [5] but still reasonably close 
to our previously obtained g-valuea^|. We also remark that by assuming a 
larger polydispersity of N = 210 ± 95, the data from Fig. [6J could be fitted 
practically equally well but now with k = 0.05 and hence q ~ 0.08 q e , and 
likewise for N = 210 ± 120 and any q > 0.5 q e . In contrast, without any 
polydispersity the best fit becomes somewhat worse than in Fig. [6J and 
q ~ 0.015 q e becomes unrealistically small. 

Finally, we also fitted the above extended model to the experimental 
data sets from Figs. [3] and [5] but for any non-negligible polydispersity this 
always resulted in a less good agreement than without any polydispersity. 



4 Since z < is ruled out, the Gaussian must be truncated and properly renormalizcd 
5 Note that also the "blocking currents" may vary substantially for different nucleotides 
and that this may be closely related to variations of q 
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Discussion 

In their seminal work (ji|), Lubensky and Nelson showed that their simple 
one-dimensional stochastic model, Eqs. [T][3j captures many of the exper- 
imental observations on polynucleotide translocation through a-hemolysin 
nanopores. However, the quantitative behavior of the experimental translo- 



cation time distribution of Kasianowicz et al. (|12l ) could not be satisfactorily 
explained. Here, we resolved this long standing problem by taking into ac- 
count that due to various electrokinetic effects the relevant effective charge 
of the nucleotides is substantially reduced compared to their bare (nominal) 
charge. 

A second main point of our work was to show that the model of Lubensky 
and Nelson implies an asymptotically exponential decay of the translocation 
time distribution, in agreement with the experimental results from Refs. 

CM). 



Finally, we compared the complete theoretically predicted translocation 
time distributions with experimental distributions from the literature. Tak- 
ing into account the directionality of polynucleotides, the non-negligible 



polydispersity in the experiment by Kasianowicz et al. dlj), and the fact 
that "unsuccessful translocation attempts" are not covered by the theory, 
the model of Lubensky and Nelson explains the experimental observations 
remarkably well. 

Regarding alternative, more sophisticated descriptions e.g. in term of 
bead-spring models with many degrees of freedom ()19l ). one naturally ex- 
pects that - at least within certain regimes or limits of the various model 
parameters - the main features of the one-dimensional model of Lubensky 
and Nelson should be recovered. Particularly important such features are 
the experimentally observed linear dependence of the mean translocation 
time upon the polymer length (see also Section "Model") and the main 
quantitative characteristics of the observed translocation time distributions, 
most notably their spread and exponential decay. The essential open ques- 
tion with respect to those more sophisticated models is then, whether the 
numerous extra degrees of freedom in order to describe the hydrodynamic 
and entropic effects outside the immediate pore region still play a significant 
role within the above mentioned, relevant model parameter regimes. 



We thank D. Branton for a very helpful e-mail exchange regarding Ref. (|12l). 
This work was supported by Deutsche Forschungsgemeinschaft under SFB 
613 and RE1344/8-1. 
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Figure Legends 

Figure Q] 

Schematic illustration of the experimental set up: polynucleotide chains 
(single-stranded DNA or RNA) translocate through an a-hemolysin protein 
pore due to the externally imposed voltage difference between the electrodes. 

Figure [2] 

The function g{x) obtained by numerically evaluating Eq. [2T1 



Figure [3] 

Line: Theoretical translocation time distribution tp(t) according to Eqs. [TBI 
[18] with k = 0.19 and L/v = 0.2 ms. Symbols: Experimentally observed 
translocation times, adopted from Fig. 5 of Butler et al. (flti ). 



Figure [4] 

Line: Theoretical translocation time distribution tp(t) according to Eqs. UST 
[151 with k = 0.17 and L/v = 0.42 ms. Symbols: Experimentally observed 
translocation times, adopted from Fig. 2 of Bates et al. (1371 ) . 



Figure [5] 

Line: Theoretical translocation time distribution ip(t) according to Eqs. [25] 
(see also main text for details) with k = 0.18, L/v\ = 0.42 ms, Ljvi = 
0.82 ms, and p = 0.81. Symbols: Experimentally observed translocation 
times, adopted from Fig. 2 of Bates et al. ()37l ). 



Figure [6] 

Line: Theoretical translocation time distribution ip{t), accounting for poly- 
dispersity by randomizing Eqs. [25] (see main text) with k = 0.1, Ljv\ = 
0.31ms, L/v2 = 1.3 ms, p = 0.58. Symbols: Experimentally observed 
translocation times, adopted from Fig. 2 of Kasianowicz et al. (|12l ). 
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